Towards population reconstruction: extraction of family relationships from historical documents
نویسندگان
چکیده
In this paper we present an approach for the automatic extraction of family relationships from a real-world collection of historical notary acts. We retrieve relationships such as husband wife, parent child, widow of, etc. We study two ways to deal with this problem. In our first approach, we identify all person names in a document, generate all potential candidate pairs of names and predict whether they are related to each other using classification techniques where the text fragments that occur around and between two names are sued as features. In the second approach, we train and apply a Hidden Markov Model to annotate every word in a document with an appropriate tag indicating if it is a name, a specified relationship descriptor, or neither of these. Then we look for the names connected to each other via relationship descriptors. We discuss the challenges such as processing raw data, obtaining a sufficient amount of training examples, and dealing with an imbalanced and noisy collection. We evaluate our results for each relationship type in terms of precision, recall
منابع مشابه
Extracting and Organizing Facts of Interest from OCRed Historical Documents
Historical documents contain facts that family history enthusiasts are interested in extracting. In addition to fact extraction, organizing these facts into disambiguated entity records is also of interest. This paper shows how facts from an excerpt of a page in an OCRed book can be gathered automatically with some expert knowledge.
متن کاملA survey conducted to reconstruct Ali-Qapu Transom embellishments
Sheikh Safi mausoleum includes a number of buildings of different periods, which Shah Tahmasb first turned them into a single complex. Later, Shah Abbas amended this complex and added important buildings to it. In general, the great importance of this historical monument is reflected in its relationship with the Safavid dynasty. According to the travelogues and tourists and historians’ writin...
متن کاملA survey conducted to reconstruct Ali-Qapu Transom embellishments
Sheikh Safi mausoleum includes a number of buildings of different periods, which Shah Tahmasb first turned them into a single complex. Later, Shah Abbas amended this complex and added important buildings to it. In general, the great importance of this historical monument is reflected in its relationship with the Safavid dynasty. According to the travelogues and tourists and historians’ writin...
متن کاملInvestigating the Effects of Social Networks on Family Relations from the Viewpoints of Teachers in 11th Region of Education Ministry
Today, the number of people using social networks is increasing. Meanwhile, the most important effect of these networks is on the quality of family members' relationships because it is considered as a tool that can be very effective on the relationships of family members with each other. Therefore, the purpose of this study was to investigate the effect of social networks on family relationship...
متن کاملExtension Workers’ Attitude towards e-Agriculture: A case study from Bangladesh
e-Agriculture is being the utmost desire for the sustainable development world over. The research was designed to assess extension workers’ attitude towards e-Agriculture in general. The methodology of this study is an integration of quantitative and qualitative methods based on primary data collection. The study was conducted in two upazilas (sub-districts) of Mymensingh district, namely Mymen...
متن کامل